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Method for Chan nel Estimation and Symbol Detector for Optical Receiver 

The present invention pertains to high-speed optical fiber communication systems 
and in particular to methods for Channel estimation and symbol detectors for optical 
receivers, for improving the bit error rate (BER) when detecting symbols received via 
5 channels suffering from both severe signal distortion causing undesired inter-symbol 
interference of several symbols and from severe noise. 

According to a specific aspect, the invention pertains to the preamble parts of claims 
1 and 21 which is known from US 5,313,495, "Demodulator for symbols transmitted 
over a cellular channel" and US 5,253.053, "Fractionally spaced maximum likelihood 
10 sequence estimation receiver". 

According to another aspect, the invention pertains to the preamble parts of claims 2 
and 22 which is known from W. Sauer-Greff, A. Dittrich, et al. "Adaptive Viterbi 
Equalizers for Nonlinear Channels" SIPCO, 2000', 25-29, (later referred to as 
"SauerOO"). 

15 In a digital communication system symbols are transmitted, where typically a 
number of 2" symbols are used. In the binary case (n=1), there are two different 
symbols, designated logic 0 (zero) and logic 1 (one). 

High-speed optical fiber-communication systems comprise in particular -Single^- - : 
channel systems including SDH and SONET, DWDM systems, CWDM systems and 
20 Systems for dynamically switched OTN (G.709 and related). 

A conventional high-speed optical fiber communication system as shown in Fig. 17 
comprises a transmitter 1, an optical channel 4 and a receiver 10. State of the art 
transmitters typically comprise a forward error correction (FEC) encoder 2 and a 
modulator. 3. A state of the art receiver 10 comprises a physical interface 11, a 
25 limiting amplifier (LA) 210, a clock and data recovery circuit 211 and a FEC decoder 
18. 

At the receiver side of the optical link the optical signal comprising received analog 
data r(t) is input into receiver 10. Receiver 10 comprises physical interface 1 1 which 
performs an optical-to-electrical (O/E) conversion. The analog electrical signal is 
30 input into limiting amplifier 2ia"Both, the physical interface 10, limiting amplifier 210 — 



and CDR circuit 211 have an upper cut-off frequency. Both cutoff frequencies are 
usually significantly higher than 1/(2T), T being the symbol period, in order to keep 
inter symbol interference low. On the other hand too much excess bandwidth in 
excess over the required minimum picks up more noise from the optical link 4, which 
s degrades the receiver performance by increasing the bit error rate. In typical 
receiver designs an excess bandwidth of 50% to 100% is therefore provided (S. U. 
H. Qureshi, "Adaptive equalization", Proc. IEEE. Vol. 73, 1985, pp. 1349-1387, later 
referred to as "Qureshi85"). 

The optical link comprises optical fibers, which attenuate the optical signal and in 
10 addition constitute a dispersive channel. In order to compensate for the attenuation 
the optical link may comprise optical amplifiers comprising Erbium-doped fibers, 
which add noise to the optical signal thereby degrading the signal-to-noise ratio. 

In state of the art dense wave division multiplexer (DWDM) systems the optical 
signal suffers from sever signal distortions that are caused by chromatic dispersion 
15 or group velocity dispersion (GVD), polarization mode dispersion (PMD), self-phase- 
modulation (SPM), four-wave-mixing (FWM), cross-phase modulation (XPM), and 
polarization dependent loss (PDL). These kinds of distortions cause inter-symbol 
interference (ISI). 

Conventional_-receivers=for=highrspeed fiber-optical communication systems, employ 
20 a decision circuit that operates only under "open eye" conditions, i.e. when the "eye 
diagram" at the decision circuit allows a choice of sampling phase and threshold 
such that a hard binary symbol decision can be made with sufficiently low error rate 
(cf. E. Voges, K. Petermann, (Eds.), "Optische Kommunikationstechnik", Springer, 
Berlin Heidelberg, 2002; G. Keiser, "Optical Fiber Communications", 3rd ed., 
25 McGraw-Hill, 2000; G. P. Agrawal, "Fiber-Optic Communication Systems", 2nd ed., 
WileyrNew^orkr-1 997): 

Moreover optical links suffer from varying received optical power by tens of dB and 
impe rfect e. g. band-lim ited or chirped transmitters. In ad dition comm unication 
channels may be time-variant and ensemble-variant, which results in varying 
30 distortion, ISI, noise and optical effects. 



It .s well known that a maximum-likelihood sequence detector (MLSD) is th. 
optimum detector if the receiver has perfect knowledge about the channel However 
.nterestingly, K. M. Chugg, A. Polydoros showed ("MLSE for an Unknown Channel - 
Part I: Optimality Considerations", IEEE Trans. Commun. Vol. 44, 7, 1996 836-846 
5 later referred to as "Chugg96») that there is no well-defined jointly optima.' estimate 
of both an unknown linear channel and data sequence in the maximum-likelihooc 
sense. Hence, following the de-facto conventions in the literature e.g. M Gosh C L 
Weber, JVIaximum-likelihood blind equalization", Opt. Eng. Vol 31 6 1992 1224' 
1228 (later referred to as «Gosh92»), in the following the term -optimal- or "optimized" 
10 .s used in a somewhat loose serise. What is meant is that a solution of minimized 
BER ,s sought within some practical framework or solution space, not excluding the 
case that in a slightly modified framework even lower BER might be achieved. 

MLSDs are mostly implemented using the Viterbi algorithm (VA), originally proposed 
by A. J. Viterbi in "Error Bounds for Conventional Codes and an Asymptotically 
15 Optimum Decoding Algorithm- (IEEE Trans. Inf. Theory, IT-13. pages 260 to 269 
Apnl 1967) for decoding convolutional codes (confer Shu Lin, Daniel J. Costello Jr ' 
"Error Control Coding: Fundamentals and Applications" Prentice-Hall, Inc ' 
Englewood Cliffs, New Jersey 07632, 1983). 

The Viterbi algorithm may also be used for channel equalization in order to cope 
20 w,th ISI. Oh a BinarVTSTch-a-ntie« a channel memory of m bits, tfBSK^W' 
corresponding to all possible bit sequences of length m and 2 transitions entering 
and leaving each state, i.e. there are 2^ transitions between successive stages or 
time units in the trellis. 

in an initializing step of the Viterbi algorithm beginning at an initial stage a path 
25 metric for a single path entering each state at the initial stage is computed. Each 
trans,t,on between states corresponds to a symbol. The path and its path metric is 
stored for each state. " " 

The ViterbLalgorit^ a repeating step jn tne repeating step ^ 

path metric for all the paths entering a state at a stage is computed by add^the " 
30 branch metric entering that state to the metric of the connecting survivor at the state 
of the preceding stage, ftrejch state the path with the largest path metric, called 



survivor path, is stored together with its path metric, and all other paths are 
eliminated. 

A log-likelihood function log P(r|v) is called the metric associated with the path v and 
is denoted M(r|v). The metrics to be chosen depend on the properties of the 
5 transmission path. They may e.g. be obtained from measurements of signal 
statistics (noise), or from a priori knowledge. The metrics may be assumed to be 
time invariant and listed in a look-up table for each transition from one state to 
another for a special application or it may be obtained from on-line measurements 
which will be explained in more detail in connection with SauerOO. 

10 More recently, it became desirable to operate optical links under conditions in terms 
of distortions and noise that would lead to a closed eye due to ISI at the detection 
circuit (cf. e.g., ECOC and OFC, annual conferences). Most approaches to solving 
this problem use either optical or electrical equalizers to compensate ISI in order to 
"open" the closed eye at the detection circuit. These approaches are mostly based 
15 on sub-optimal methods of equalization, namely linear' feed-forward equalization 
(FFE), decision-feedback equalization (DFE), or a combination of both, FFE and 
DFE (Cf. e.g. K. AzadeL et al., Equalization and FEC Techniques for Optical 
Transceivers", IEEE J. Solid State Circuits, Vol. 37, 3, 2002, 317-327 23; Bohn, 
Mohs, et al., "An Adaptive Optical Equalizer Concept for Single Channel Distortion 
20 Compensation",' ECOC TZutffTTC^SHait et al., "Adaptation of "Ele"ctiWfc "PMD i- 
Equaliser Based on BER Estimation Derived From FEC Decoder, ECOC 2001 (later 
referred to as "StichtOI"); F. Buchali, H. Bulow, W. Kuebart, ..Adaptive Decision 
Feedback Equalizer for 10 Gbit/s Dispersion Mitigation", ECOC 2000; S. Otte, W. 
Rosenkranz, "Performance of Electronic Compensator for Chromatic Dispersion & 
25 SPM", ECOC 2000; H. Bulow, F. Buchali, G. Thielecke, "Electronically Enhanced 
Optical PMD Compensation", ECOC 2000; H. Bulow, "Electronic PMD Mitigation - 
from Linear Equalization to Maximum-Likelihood Detection"7OFC'20D"1T 

Only some publications (e.g. US 2002/0080898 A1, "Methods and systems for DSP- 
based Receivers a -and-H— Haunstein, A. Dittrich et al. B lmplementation-of-near- 
30 optimum electrical equalization at 10 Gbit/s", ECOC 2000, later referred to as 
"HaunsteinOO") either discuss or investigate the use of a theoretically optimum 
ma yj m nm-iikPiihnnd-teceiver.-which. -in this context, is also referred-to-as-MLSE— 
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equalizer. Strictly speaking, there is no explicit equalization step in a MLSD It is 
however, conventional to call a MLSD a Viterbi equalizer. However these 
approaches are rarely used in practice, probably because this receiver type is 
commonly believed to be too complex as outlined in QureshiSS, in particular p 1370 
5 And notably, in optical communications, MLSD receivers so far have only been 
discussed as symbol spaced solutions, which are believed to have no severe 
sampling phase dependence (cf. StichtOI). 

With the advent of the so-called optical transport network (OTN), forward error 
correction (FEC) schemes are now conventionally used to provide improved 

10 tolerance against noise resulting from both optical amplifiers in the transmission 
channel and receiver. E.g. a 16 times interleaved (255,239) Reed-Solomon code 
has been standardized in OTN recommendation G.709 of the ITU (International 
Telecommunication Union). More recently, even stronger FEC schemes are 
conventionally used, which can work with pre-decoding-BER as high as e g 10* (Cf 

15 ECOC and OFC). When BER estimation is used fo/optimization of some receiver 
parameters, this BER estimation is conventionally computed in a FEC decoder (e o 
StichtOI). * * a " 

In addition to the usual white Gaussian noise model, in optical receivers noise 
correlation occurs (caused e.g. Jby noise coloring in receive filters or by DWDM 

20 channel interferes), arid^ors¥-mIrbT sighaWependent. especially in ' o?ticW" 
amplified systems. For MLSD receivers, noise correlation and signal dependent 
no.se can be handled for special noise models, as is discussed in the context of 
magnetic recording (A. Kavcic, J.M.F. Moura, The Viterbi Algorithm and Markov 
Noise Memory", IEEE Trans. Inform. Theory, IT-46, 1, 2000. 291-301 (later referred 

25 to as KavcicOO); A. Kavcic, J.M.F. Moura, "Correlation-sensitive adaptive sequence 
detection", IEEE Trans. Magn., Vol. 34, 1998, 763-771 (later referred to as 
Kavcic98)), albeit restricted to the symboi-spaced receivers and to Gaussian noise 
processes. 

Conventional high-speed-clock-recovery-circuits for broadband systems-can-even- 
so fail for severely distorted signals under noise. However, when they work, they will in 
general provide a sub-optimum sampling phase, which calls for controlled sampling 
phase adjustment-as-is-shown-i n _e.g.-Sticht01. For carrying out this--invention-it-is_ 
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assumed that some state of the art clock recovery subsystem (see Fig. 2, CR 14) is 
available that is able to recover a clock with approximately fixed but otherwise I 
arbitrary phase relation to the transmit clock. The remaining, non-trivial, problem 
then is to find a sampling phase that leads to minimum BER or to near-minimum 
5 BER. Especially for distorted input signals, it is neither assumed nor required that 
the raw sampling phase as recovered by clock recovery is a BER-optimal sampling 
phase. 

Unlike e.g. In mobile wireless communication, in optical communications the receiver 
must often adapt to the received signal without the use of a training sequence. 

10 Moreover, several effects causing distortion are significantly time-variant albeit not ! 
extremely fast. In essence, the receiver is faced with a difficult adaptive blind 
equalization problem i. e. both the transmitted data sequence and the transmission 
channel properties are unknown. In principle, there are several known approaches 
to the blind equalization problem e. g. the Bussgang algorithm, Higher-Order 

15 Statistics and joint channel and data estimation, all basically' using a nonlinearity to 
generate some substitute for the missing reference or training signal. All except the 
probabilistic estimation methods make use of a linear filter model of the channel. 

According to Simon Haykin, "Adaptive Filter Theory", 4 th edition, Prentice-Hall, 2001 , 
for blind equalization the problem of ensuring convergence to the global BER 

20 minimum is an open prbblenT^TotoW blind equalization solutions that" 

have been described for certain, e.g. wireless applications, the so-called sequence 
feedback (cf. J. W. M. Bergmans, "Digital Baseband Transmission and Recording", 
Kluwer Academic Publishers, Dordrecht, 1996) or PSP approaches (Chugg96; R. 
Raheli, A. Polydoros, C.-K. Tzou, „Per-Survivor Processing: A general Approach to 

25 MLSE in Uncertain Environments, IEEE Trans. Commun. Vol 43, 2/3/4, 1995, 354- 
364, later referred to as "Raheli95") to channel estimation, due to its complexity, are 
not suitable for high-speed optical communication receivers. 

An integral part of most equalizer solutions, including MLSD equalizers as disclosed 
e.g. in US 5,313,495 -and-US-5r2637053-is-the concept of an error signal-that-is — 
30 based on synthesizing a desired hypothetical channel response, given a current 
linear channel model estimation, tentative decisions made in the detector and the 
ac tu al'y r^myfiri Rignal-This-hvpothetical- -response and the actually- -received — 
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response are compared and used to derive error signals or decision metrics. Such 
an error signal and the derived metrics then Incorporate mainly the effects of noise 
plus the effects of residual mis-equalization i.e. of Imperfections of the channel 
model. Residual mis-equalization is sometimes referred to as convolutional noise 
5 However, as discussed e.g. in HaunsteinOO, an explicitly linear channel model is 
fundamentally inappropriate for the nonlinear optical channel employed in intensity- 
modulated signaling with direct-detection square-law receivers. 

It is still believed that a training sequence is required in optical communications for 
channel acquisition (HaunsteinOO; A. Dittrich, M. Siegrist, W. Sauer-Greff R 
iDUrbansky, .Iterative Equalization for Nonlinear Channels with Intersymbol 
Interference", Kleinheubacher Berichte, 2001). 

In contrast to most estimation methods that estimate the parameters of an explicit 
filter channel model, EP 1 139 619 A1, "Channel estimation using a probability 
mapping table" describes a very interesting implicit channel /estimation method for 

15 sequence estimation, based on histogram sets: These histograms represent sample 
amplitude statistics conditioned on channel state and are used to derive branch 
metrics for a MLSD. The scheme described, however, is limited to symbol spaced 
processing and describes only the case of a sample depending on preceding 
symbols. Moreover, the application fails to disclose any suitable method for 

20 initializing such a receiver <blirid ; acqui^ the need for training. 

More specifically, SauerOO discloses an adaptive Viterbi equalizer for non linear 
channels. For white noise and equally probable symbol sequences an MLSE 
minimizes the sequence error probability. The received analog input signal is 
sampled at the symbol rate T after analog processing by a matched filter. It is 
25 assumed that consecutive samples are statistically independent and a sample 
depends on L+1 symbols only due to ISI. • The metric increments being equivalent to 
the logarithm of channel transition probabilities describe the statistical properties of 
the transmission channel and do not depend on assumptions like linearity or 

- Gaussian probability density-functlon;-they-are--pre-computable or result- from- 
30 measurement. A look up table may be provided which is addressed by q quantized 

input bits and L+1 bits for the channel state in order to obtain the metric increment. 

- The look up table may -be-based-on-measurements. The probabilities can-be- 
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approximated by relative frequencies of occurrence, i.e. the number of occurrence of 
the event, that for a certain channel state (current and L previous symbols) the 
sampled output is within a quantization interval associated with q quantized input 
bits per total number of trials. After a sufficient long accumulation period the 

5 logarithms of the event counts normalized to the accumulation period yield, the look 
up table entries. Thereby precaution against zero event counts has to be 
implemented e.g. by interpolation. To set up the look up table for unknown 
channels, a known training sequence, addressing all different channels states has to 
be transmitted for a sufficient long peridd. To update the conditional probabilities 

10 during normal data transmission, memory cells are addressed using the estimated 
states resulting from the MLSE output. 

HaunsteinOO and EP 1 139 619 A1 written partly by the same authors or inventors, 
respectively disclose similar subject matter. 

Kavcic98 discloses both "leading and trailing isi lengths" corresponding to pre- and 
15 postcursor symbols in the context of magnetic recording restricted to the symbol- 
spaced receivers and to Gaussian noise processes. 

US 5,313,495 discloses a demodulator for symbols transmitted over a digital cellular 
channel. It comprises a MLSE which is implemented using a Viterbi algorithm. 
^-^Gellulac. channels suffer from multi-path=fading.==3;he--^iterbi,.equalizer.:may require 
20 excessive computation overhead when estimating symbols which are subject to an 
ISI. In cellular communication systems, because geographic changes of the 
transmitter are frequent and unpredictable, fading and ISI become excessive and 
the use of a Viterbi equalizer requires that an algorithm be employed which 
implements 16 or 64 states. A simpler four-state Viterbi equalizer using a first order 
25 least means square channel estimator only marginally meets the BER requirements 

forthe-cellularsystemr J Thehigher-order^such^s-16-or-64-stateJ/jterbLequalizemiIl — 

require a prohibitive amount of computation. Therefore a four state Viterbi equalizer 
is provided together with over sampling the signal at twice the normal symbol rate. 

In -addition to calculating the branch-metrics-based- on two samples, channel -• 

30 estimation is based on over sampled symbol data. 

Also US 5,263,053 discloses a fractionally spaced m aximum likelihood sequence 
estimation receiver. An embodiment is descnoed in connection with 7t/4-shifteU — 
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deferential quadrature phase-shift-keying (k/4-DQPSK) transmission which has bee, 
proposed for digital transmission using oelluiar telephones. Due to the muM-pat. 
ctaracterisflc IS. distortion and noise corruption do ooour. For toe MLSE toe iTrb 
algorithm is used. The state of a channel can be though of representing^ 
s symbois toe, have been applied to „ a, any particular time where the channe 
memory length Is L symbol periods before toe present symbo, Two fold ov- 
sampling is performed. 

It is the object of this invention to optimize toe bi, error rate of a digital signal 
10 received via an ISI-impalred, noisy channel. 9 

This object is achieved by toe subject matters of the Independent claims. 

Purred embodiments of toe inventton era toe subject matter of toe dependent 

-It is advantageous to decide a symbol wift toe he,p'of both, pracursor and 
slnZLtTT *"* **" d,3P9r3k>n °' *« which may be 

ZbT 8 Symb0 ' to ^ ^ b ° th ' Pre0ed,n « and **>wing 

A fractionally spaced maximum-Hkelihood "s^uence 'd^tor fe advantageous for 
compensating inter symbol interference since toe various ktods of disperaions o, 

sy^Tota 3 0On " nU ° US bmadenin9 ° f ° ne m ' nto * e 

The various kinds of dispersions of toe optica, channe, result in a continuous 
broadening of toe symbols and consequently in at. Especially in connecBon with 
excess bandwidth a fracUonal IS. compensation provides better performance 
25 requires fewer symbols to be allowed for In toe ISI compensation and as a 
_S=5££ requires iess computajfonra^^ an equlva|en , 

Obtaining branch metrics from detected symbols advantageously automatically 
— adapts-the branch metrics to the channet-aetually-used^-So tois way of updating the 
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branch metrics provides a practical solution for the blind acquisition problem of optical 
channels. 

Fitting a model distribution to counter values proportional to the frequencies 
measured eliminates runaways and in particular eliminates counter values of 0 
5 which have to be replaced by a low value in order to avoid an error when a logarithm 
is calculated therefrom in order to obtain the branch metrics . Two-fold over- 
sampling is feasible even in high frequency applications and results only in a 
moderate cost increase compared to symbol sampling. 

In an embodiment each event is defined by a channel state and a digital word 
10 independently of the sampling phase or time. This on the one hand ignores the 
relation between the sampling phases and on the other hand leads to a simple 
implementation. Moreover, counting each kind of events results in higher counter 
values compared to more sophisticated methods. In particular the later point results 
in better statistics even after short accumulation periods: ' 

15 Distinguishing between a first kind of events relating to a 'first sampling time and a 
second kind of events relating to a second sampling time -conditioned on the digital 
word measured at the first sampling time accounts for the correlation of sampling 
values for the same symbol obtained at different sampling times. 

7 'in'ordtTtolmprove the sampling statistics fo7 me counter values relating to the 
20 second sampling phase, the digital words obtained at the first sampling phase may be 
grouped into subsets. The number of subsets Is smaller than the number of possible 
digital words and a so-called coarse digital word is associated to each subset. In this 
embodiment a second event is defined by a channel state, a digital word obtained at 
the second sampling phase conditioned on a coarse digital word. 

25 m another embodiment, wfiicfi-allows tor the-correlation-of-the-different-samples- 
obtained for the same symbol, only one kind of events is counted which is defined by 
a channel state and a digital word for each sampling phase. In the case of two-fold- 

over .samp ling this embodiment only requires as much-counts"'" * are required for 

counting the second kind of events for example obtained at the second time 
30 conditioned on the sample obtained at the first sampling time. Moreover, the branch 
metric - calcu l ated from tho count o r va lues -c o nstitntft-tntar-branch metrics. So this 
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embodiment does not require an addition of sample branch metrics in order to obta 
(total) branch metrics. 

Providing, the adjustment of sampling times into a quasi-continuous delay of the 
sampling clock and a discrete sampling phase adjustment shortens the delay range 
5 for the sampling clock and thereby is much easier to implement. 

Proper adjustment of the sampling times or phases lowers the BER. Adjusting the 
sampling phase based on bit error rate estimates leads to an optimum bit error rate 
by definition. 

Also the adjustment of the sampling phase by maximizing a population dlfferenoe 
10 parameter results in a at least near optimum BER. 

An idle period between two conseputive accumulation periods can be reduced r 
additional circuitry is provided for performing the counting 'of each kind of events ir 
parallel to the calculating of branch metrics for the cha'nnel statistics accumulatec 
dunng the previous accumulation period. 

15 Blending the old branch metrics with new branch metrics using a forgetting factor may 
be centered as extending the averaging period over the accumulation period This 
speeds up the adaptation of the branch metrics to new channel conditions because 
-th-ejaccumurat^^ can be shorter than a r^^^^^^, Tnis js jn 
particular relevant for the embodiments counting a large number of events due to bad 

20 statistics. Moreover, such a way of updating reduces the danger of oscillation 
between two independent meta-stable channel models when calculating the branch 
metrics in parallel to the counting. 

Blending old branch metrics with newly calculated branch metrics is mathematically 
less correct, but it is acceptable if only small changes are expected. In this 
25 embodiment it is not necessary to save the old counter values used for calculating the 
old branch metrics. 



Due .to the non-linear nature of any logarithm it Fmore correct to blend old and new 
counter values and calculate the branch metrics from the blended counter values. 
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Setting the branch metrics for the channel states for isolated O's and 1's to identical 
values when initiating the branch metrics constitutes a generic channel model which 
provides good convergent behavior in, both low and high dispersion cases. 

Setting the branch metrics for channel states being symmetrical to each other to 
5 identical values provides a good starting point for dispersion affecting precursor and 
postcursor symbols in a similar manner. 

Monitoring for an abnormally high bit error rate and/or pathological amplitude 
statistics allows re-initialization with a hopefully more appropriate set of branch 
metrics. 

10 In the following preferred embodiments of this invention are described referring to 
the accompanying drawings. In the drawings: ( 

Fig. 1 shows a block diagram of an optical fiber communication system. 
Fig. 2 shows a more detailed circuit diagram of the Clock recovery circuit. 
Fig. 3 shows a bit rearranging circuit. 
15 Fig. 4 shows a trellis for a MLSD. 

., -..-JEig-S illustratesithebraneh metric computation-for different embodiments.. 

Fig. 6 shows a set of frequencies or counter values used for the calculation of 
a channel model. 

Fig. 7 shows the partial branch metrics of a channel model for which the 
20 samples of different sampling phases are assumed to be independent. 

pj g - 8 -g ngW g_thf»-r gr * ia ' hranrh metrics of a channel model s pecific for the 

first sampling phase. 

Fig. 9 shows the partial branch metrics of a c hannel model specific for the 
second sampling phase conditioned on the previous sampling value r n 
25 obtained at the first sampling phase. 
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Fig. 10 shows the branch metrics of a channel model specific for the secom 
sampling phase conditioned on the previous sampling value r, obtained at the 
first sampling phase. 

Fig. 11 shows the partial branch metrics of a channel model specific for the 
second sampling phase conditioned on the previous coarse sampling value 
R(ri) obtained at the first sampling phase. 

Fig. 12 illustrates the application of a channel model. 



metric 
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Fig. 13 illustrates an update cycle for parallel accumulation and branch 
computation. 

Fig. 14 shows a level crossing of isolated ones. 
Fig. 15 shows a starting histogram. 

Fig. 16 shows a method for channel monitoring 'and selection of an 
appropriate starting histogram. 

Fig. 17 illustrates a conventional high-speed optical fiber communication 
system. 



Abbreviations 



a.k.a.: Also known as 

ACS: Add-Compare-Select 

ADC: Analog-To-Digital Conversion 
20 AGC: Automatic Gain Control 

APD: Avalanche Photo Diode 

BER: Bit Error Rate 

BMU: Branch Metric Unit 

BRC: Bit rearranging circuit 
25 CDR : Clock and Data Rec overy 

CE: Channel Estimation 

CR: Clock Recovery 

CWP M: Coarse Wavelen gth Division 
Multiplexing 



30 DFE: Decision-Feedback Equalizer 
DGD: Differential Group Delay 
DLL: Delay Locked Loop 
DMUX: Demultiplexer 
DSP: Digital Signal Processor 

35 DWDM: Dense Wavelength Division 
Multiplexing 
ECOC: European Conference on 

Optical Communication 
FEC: Forward Error Correction 

40 FFE: Feed-Forward Equalizer 
FS MLSE: fr actionally spaced M LSE 
FWD: Frequency Window Detector 
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25 PLL: Phase Locked Loop 
PIN: Positive Intrinsic Negative 

(doping structure) 
PMD: Polarization Mode Dispersion 
PSP: Per-Survivor Processing 
30 r.m.s.: Root mean square 

SDH: Synchronous Digital Hierarchy 
SIPCO: Signal Processing Conference 
SMU: Survivor Metric Unit 
SONET: Synchronous Optical 
35 Network 

SOS: Second-Order Statistics 
SPA: Sampling Phase Adjustment 
SPM: Self-Phase/Modulation 
SW: Software' 
40 TBU: Trace/Back Unit 
TED: Timing Error Detector 
TF: transversal filter 
TIA: Trans-Impedance Amplifier 
VA: Vrterbi Algorithm 
• 45 VGA:*" Variable Gain: Amplifier -= - 
VCD: Voltage Controlled Delay 
VCO: Voltage. Controlled Oscillator 
XpM: Cross-Phase Modulation 

Mathematical Symbols 



at 


source data 


i: current symbols . 


b: 


channel state vector 


60 j: number of postcursor symbols 


BM: 


branch metric 


k: index of consecutively 


c: 


count 


calculated sets of BM 


55 4: 


encoded data 


L: oversampling factor 


f: 


frequency 


1: Indexfor sample within one 


F: 


forgetting factor 


65 symbol; 1 e1...L * 


h: 


number of precursor symoois 
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FWM: Four-Wave-Mixing 
GVD: Group Velocity Dispersion 
HOS: Higher Order Statistics 
HW: Hardware 
5 ISl: Inter-Symbol Interference 
ITU: International 

Telecommunication Union 
LA: Limiting amplifier 
LP. Loop Filter 
10 LOS: Loss Of Signal 

MAP: Maximum A-Posteriori 

Probability 
MLSD:Maximum-Likelihood Sequence 
Detector 

15 MLSE; Maximum-Likelihood Sequence 
. Estimator 
MUX: Multiplexer • 
NRZ: Non-Re*turn-to-Zero 
OSNR: Optical Signal-To-Noise Ratio 
20 OTN: Optical Transport Network 
PD: • Phase^DetectOr - - - 
PDL: Polarization Dependent Loss 
PFLL: Phase / Frequency Locked 
Loop 



M: 
N: 
5 Q: 

q: 

r(t): 
R: 
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s: 



number of states in Viterbi 
detector 

number of symbols defining the 
channel state 
number of distinguished 
quantization values (2 R ) 

0,1 Q-1 

received analog data 
resolution (in bits) of the 
quantizer 

channel state index 



S: 
T: 

y(t): 

15 r u : 

Hi.' 

20 x ( : 



number of different symbols 
symbol time 
sent analog data 

(oversampled) quantized data 
qeO...Q-l 

associated (oversampled) 
quantized data q e 0...Q - 1 
Detected (undecoded) data 
Decoded data 



White the present invention is described with reference to the .embodiments as 
Olustrated in the following detailed description as well as in the drawings, it should be 
2 a underetood that the fbltewing debited description as wen as the drewings are no. 
intended to limit the present Invention to the particular illustrative embodiments 
disclosed, but rether the de described illustrative embodiments merely exemplify the 
various aspects of the present Invention, the scope of which is defined, by the 
appended claims. y e 

30 In particular, the recehrer concept described In this application is motivated by but 
not limited to fiber opticarcimm^nloaiion. It can be applied for any dW&^band' 
communication system with a- P riorl unknown multi-symbol ISI that extends over a 
moderate number of symbols. An optical receiver comprising an Inventive digitizer 
can operate with an acceptable OSNR penalty of below 8dB. At a data rate of 

35 10.7Gblt/ It can wo* e.g. In a range of -3500 ps/nm to 3500 ps/nm residual GVD or 
up to about 240 ps instantaneous (first order) DGD. Severe! sources of distortion 
can occur simultaneously. e.g. GVD combined with PMD, at mutual expense of their 
OSNR penalty contributions. The M=4 receiver wil. wo* as long as dominant (within 
quantaer accuracy) parts of the Impulse response do not spread significantly 

40 beyond a totel vrfdft^hree^mrW periods. This enables up to 200km optically 
amplified metro links without dispersion compensation. ~ 

Fig. 1 shows an optical transmission system. It comprises a transmitter 1, an optical 
channel 4 and-aH=ecelver-1^A4ypioal transmitter 1 comprises an-FEG-encoder-Wor- 
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encoding input data a, in order to generate encoded data d, which is forwarded to a 
modulator 3. The modulator 3 generates an optical signal comprising sent analog 
data y(t) constituting the output of transmitter 1. There is no low-pass filter for 
explicitly band-limiting the spectrum in the baseband before modulation. Neighbour 
5 channels of DWDM systems are. separated in the optical domain by optical 
bandpass filters. 

The optical signal is transmitted via optical channel 4 to receiver 10. 

At the receiver side of the optical link received analog data r(t) is input into receiver 
10. The receiver 10 comprises a physical interface 11. a AGC or variable gain 
10 amplifier (VGA) 12, an ADC 13. a clock recovery (CR) 14. a sampling phase 
adjustment (SPA) circuit 15. an MLSD 17. a FEC decoder 18. a channel statistic unit 
19 and a receiver control node 20. In. addition receiver 10 may comprise a bit 
rearranging circuit (BRC) 16 in particular if the delay range of SPA circuit 15 is 
smaller than a symbol period. 

15 The physical interface 11 performs an optiCal-to-electrical (O/E) conversion. The 
physical interface is a standard PIN or APD optical front-end with trans-impedance 
amplification (TIA). The physical interface also acts as an implicit low-pass filter for 
the received analog data. 

The analog serial signaTdate at 1ST "SSpSf of a PIN or APD optical front-end is 
20 amplified by a high-gain higlvdynamic, low-noise automatic gain control (AGC) 
circuit 12. The output signal of AGC 12 is designated r(t) . The AGC circuit 12 may 
amplify the analog electrical signal to a constant level in terms of peak-to-peak 
voltage, average rectified voltage or root-mean-square voltage. In another 
embodiment the amplification of AGC circuit 12 may be controlled by control unit 
— 25-receiver-control ?n based on q uantized data r, (cf. US 3,931,584. "Automatic 
Gain Control") for fine-grained control of the amplification.. In the latter case it is 
more appropriate to designate unit 12 as a variable gain amplifier (VGA). The 
control of the VGA_may r .b.e.based.on_frequencies of peak digital, values jUcf.-U.S_ 
2002/0113654 A1) or on a frequency of digital values D within a digital value range 
30 (cf . US 3.931 ,584). In another embodiment, a coarse and a fine VGA circuit may be 
provided. These cjrcujtemay be controlled by one of the methods discl osed in co- 
pending European patent application number 03009564.0. "Method Tor comrolnng 
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amplification and circuit", which has also been filed by CoreOptics and i 
7 referenC8 - B8Sed " ^ — * -—^5 oLnn 
3.931584) or cateulate a uniformity parameter, in compiiance with EP03009564 C 
. for adding fhe gain of AGC/VGA circutt 12. in any embodimen, me S 

ssrisr 12 maps - ~ w ° - ~- ° 

^ ADC 13 d^es ft e ana, og signal f(t) and outputs guandzed date B =r, lnde> 

to a^1 ,oTrL^ indeX,ted ^ n,MmPlin9Pte8e8 - ,ndeX "" a V assume the 
SPA ci^t 15 Tt ° VerSamP " n9 - ADC 13 — « a sampiing dock from 
SPA *rcu, 15 whtch in ten, receives a sampling dock from clock recovery 

SPA c«urt 1 5 operates as an adjusteble delay in oroer to optimize the phase of the 
clock which .is to say to optimize the sampling times of ADC 13. 

" luoTvZt *°" n Fi9 ' 1 ^ °" ^ ~" «» "°<* -covery 
subsystem 14 recovers ft. eymbol frequency. Sampling is perfonned either on fte 
*ng or nsing Cock edge of the Cock inputted into ADC 13. „ two « w 
sampling is penned fte Cock recover su.bsys.em14 may also recover tee symbo. 

I k T 9 * L " f0,d ""Wi «-es highl IhaT 

the symbo, frequency is recovered. AHemaaveiy. tor L-told oversampiing. muftpntse 

Lmll " L 7 r mUl,iPhaSe ** ^ mn ° and «*- « — ^ 
. fXS "* Symb0 ' " U2 ** ^ — ^bo, 

The receiver control node 20 in connection with channel statistics unit 19 may 
perform a method similar to the disclosure of WO 02*0035 A1. Alternativeiy 
receiver con.ro. node 20 in connection w«h channel statistics unit 19 and -SPA CrcuK 
15 may perfom, ^RTHw,^^. Eurbpean-paBnt- 
so app„ca*on number 03004079.4. "Seaming method tor adjusted, of a sampling 
Phase in an oversampiing receiver and circuit-, which has also been fried by 
CoreOptics and is iaterraferred to-as^P03004079.4. in particular, from the channel 
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statistics also a population difference parameter may be calculated for performing 
phase adjustment as disclosed in this co-pending European patent application, 
which is incorporated herein by reference. 

Finally the receiver control node 20 may obtain bit error estimates from MLSD 17 or 
5 FEC decoder 18 for optimizing the amplification of AGC/VCA circuit 12 or the. phase 
by controlling SPA circuit 15. Receiver control node 20 may perform a gradient 
search in order to minimize the bit error estimates. As bit error estimate an 
unreliable detection event as described in co-pending European patent application 
number 03002172.9, "Error rate estimation method for a receiver and receiver 
10 apparatus", which has also been filed by CoreOptics, may be used. European 
patent application nUmber 03002172.9 is incorporated herein by reference. In one 
embodiment the ADC 13 has a three bit resolution corresponding to eight 
distinguished quantization levels. In other embodiments the ADC resolution may be 
different e.g. two, four or eight bits corresponding to four, 16 or 256 quantization 
15 levels. . 

The ADC 13 may comprise a single sampler sampling the analog signal at the 
appropriate frequency. The output may be provided serially to MLSD 17. In another 
embodiment, which is compatible with Fig. 1 , the output of an oversampling sampler 
may be demultiplexed and latched for further processing by bit rearranging circuit 
~l»o 16. In another embodiment also compaSHl(B~^tti"1 Fig. 1, one sampler may be 
provided for each sampling phase. Each of the samplers operates at the symbol 
frequency and may latch its output for further processing by bit rearranging circuit 
16. 

The quantized data r M are input into bit rearranging circuit 16 which is explained in 
25 more detail in connection with Fig. 3. Bit rearranging circuit 16 outputs associated 

data r* M in to Ml:SD~17r- MbSD-17-may-implement-a-Viterbi-algorithm-(VA)-and — 

outputs the most likely sequence designated detected data u, to FEC decoder 18. In 
a typical optical receiver, with a powerful FEC code used, the bit error rate at the 

output of MLSD 17 Tanges-eTgr-from-IO^to-about 10" 4 . The subsequent FEC— 

30 decoder 18 further reduces bit error rate to a range between 10" 9 and 10" 16 which is 
required for data transmission. FEC decoder outputs decoded data Xi for further 
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processing. MLSD 17 and/or FEC 18 may obtain BER estimates and provide sarr 
to control node 20. 

Control node 20 receives a loss-of-signal (LOS) signal from physical interface 1 
and may receive counter values or event frequency information from statistic unit 1 
5 in order to obtain pre-processed statistics data for controlling the AGC/VGA circi 
12, CR 14, SPA circuit 15 and bit rearranging circuit 16. 

The clock recovery subsystem 14 is shown in more detail in Fig. 2. It may t 
referred to as a phase / frequency locked loop (PFLL). The clock recove 
subsystem 14 comprises a phase detector (PD) 31, a loop filter (LF) 32, a voltac 
10 controlled oscillator (VCO) 33 and a frequency window detector (FWD) 34. 

Initially, phase detector 31 is disabled and the frequency window detector 34 
active. The clock generated by VCO 33 is compared against a local reference clo< 
CLK REF by a digital edge counting process (Cf. WO 02/30035 A1) performed t 
frequency window detector 34. In this way the frequency window detector 34 draj 
15 the VCO frequency into the target frequency window. When this frequency window- 
reached, FWD 34 is disabled and PD 31 is switched on and locks the clock of VC 
33 in frequency and phase to the received data stream. 

Clock recovery subsystem 14 recovers a frequency and "some" sub-optimal pha* 
with a fixed relation to the transmitted symbol stream. " ~ '"• 

20 For distorted signals, the recovered dock phase in general leads to sub-optimal • 
even very bad BER. 

The sampling phase that is delivered from clock recovery- subsystem 14 
dynamically adjusted in a delay locked loop (DLL), in a continuous or qua$ 
continuous way by a delay signal that is originally derived from the chann 
25 estimation (cf. European patent application number 03004079.4). 

In or der to limit the required range of (quasi-) continuous phase shifting within SF 
circuit 15, which reduces power consumption, there is a discrete l-T/L, 1=1,..., 
phase justification facility performing samples-to-bit synchronization. 
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An implementation of the phase justification facility for two-fold oversampling is the 
bit rearranging circuit shown in Fig. 3. The bit rearranging circuit comprises delay 
element. 41 and multiplexers (MUXs) 42 and 43. The delay element 41 receives the 
a clock CLK output from ADC 13 which indicates when the ADC 13 outputs valid 

5 data. The delay element may be implemented by a flip-flop or shift register, 
depending on the frequency of the clock CLK. If the clock CLK has symbol 
frequency as in the embodiment of Fig. 1 a flip-fop is sufficient. The MUXs 42 and ■ 
43 are controlled via line SEL by control node 20. Either MUXs 42 and 43 output r (>1 
and n.2, respectively or MUXs 42 and 43 output r M .2 and r,.i, respectively, as shown . 

10 in Fig. 3. 

The bit rearranging circuit effectively adjusts the association of L=2 samples to one 
symbol In MLSD 17, at acquisition time. This helps to achieve initially optimum 
sampling phase in the center of the quasi-continuous shifting range. During channel 
tracking only quasi-continuous phase shifting by SPA 15 is use,d, because a large 
15 discrete phase step may lead to a loss of the channel model. _ ( 

The channel estimation is based on decision-directed conditional quantized 
amplitude statistics, conditioned on a channel state as derived from the detected 
data U|. • ■ 

....._.... r . : -..^.A=-.GhanBeL-,state. is characterized by the ,sefeof-,channel=input^symbols -that fully 
20 determine the received noise-less amplitude in a channel with memory. A channel is 
said to have channel memory of m symbols, if the noise-free channel output 
depends on the combination of one "current" symbol and of m other pre- and/or 
post-cursors symbols. In this case, the channel length is m+1. As usual in 
equalization of uncoded sequences, the Channel state can be represented by a 
25 sequence of N symbols. In a binary embodiment a symbol is equivalent to one bit. 

— ■ Tho-ccquen.ce-of-?y m ^ > ^ r ^ m P rises - Qne - CQnsiaered or current symbol b [t h 

precursor symbols b,. h , ... b M , preceding the current symbol b, and j postcursor 
symbols b,+i, ... b, +j following the current symbol (N=h+1+j). Consequently the 

_ channel- state at the current symbol -bi^ca^be-described-by- channel state vector 

30 b,=(bi-h. - b i+J ). Provided that there are S different symbols there are S N different 
channel state vectors. In the embodiments disclosed in connection with Figs. 4 to 
i^a-channel state is defined by 3 consecutive-symbols-and-a symbol corresponds 
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to a bit, which may assume the values 0 or 1. A channel state is encoded into 
transition in the trellis and for each transition a branch metric is calculated; it may t 
said that each possible channel state for a current symbol is tentatively considered. 

In prior art (cf. EP 1 139 619 A1 and US 5,263,053), the channel state is most 
5 described as the current symbol plus a number of precursor symbols and tr 
channel output is said to depend on the current and those previous symbols. This 
misleading, since the best mapping of a channel state to a bit to be decide 
depends on the nature of interference. For symmetrical pulse dispersion, whic 
broadens a symbol to "diffuse" or "flow" into preceding or: following symbols, it 
10 best to decide a symboj.with the help of both precursor and postcursor energy. 

The fractionally spaced MLSD for an.un-coded ISI channel works on a canonic 
symbol-spaced ISI trellis, as shown in Fig. 4. In this trellis, the transitions are labels 
in a three-bit notation that can e.g. be interpreted as a 1 previous, a current, and 
next symbol or bit in time sequence, when read from left to right. The state variable 
15 can be thought of as a shift register where, a new bit mo^es in from the right and ; 
the same time the leftmost bit moves out. : 

However* unlike a symbol spaced MLSD, the fractionally spaced MLSD receive 
L=2 T/L-spaced samples per symbol (bit) to be detected; which changes the we 
—~t4iow metric;iacKements .or branch metrics are-computad^fofeeachipoissibleUcansition. 

20 As will be described in more detail in connection with Figs. 5 to 9 and 11, tr 
fractionally spaced. MLSD may determine a sample branch metric value BM for eat 
of the L=2 samples and combines the L=2 sample branch metric values in order 
assign overall branch metric values BM to t to the symbol spaced transitions in tt 
trellis. . 

25 The branch metric BM to t is computed as the sum of the two sample branch metric 
as shown in Fig. 5 (cf. equations (2) to (4)). Fig. 5 shows a trellis of the (i-1) 
symbol period 51 and the ith symbol period 52,. the output signal of the AGC ?( 
and the rearranged sample values rVi and r\ 2 at a first sampling time 53 and secor 
sampling time 54. In the simplest case, which is shown in Fig. 7, the two samp 
30 branch metric .values are added, which neglects a possible correlation between tf 
two samples, for simplicity. ! : 
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in order to avoid the use of an explicit filter model as disclosed in US 5,31 3,495 and 
US 5,263,053, the channel estimation is based on decision-directed conditional 
quantized amplitude statistics, conditioned on a channel state as derived from the 
detected sequence. 

5 In compliance with the conventions for MLSDs, branch metrics are logarithms of 
transition probabilities. The branch metrics may be obtained from a complete set of 
channel-state-conditioned amplitude histograms. An amplitude histogram is a 
discrete amplitude probability mass distribution (or amplitude distribution, for short) 
conditioned on channel states at a given sampling phase. Consequently, a channel- 
10 state-conditioned histogram is the amplitude distribution under the condition that the 
channel is in a given channel state and sampled at a fixed sampling phase. In 
addition, an amplitude histogram may be conditioned on sample values obtained at 
different sampling phases. As will, be explained in connection with branch metrics . 
below, an amplitude histogram obtained at the second 7 sampling phase may be 
15 conditioned on the value obtained at the first sampling phase. The collection of such 
histograms for all possible channel states and all used over-sampling phases is 
called a (probabilistic) channel model : Sometimes it is necessary to distinguish the 
"complete" and the ^phase-specific" channel model . A phase-specific Channel model 
is the subset of a complete (probabilistic) channel model that is restricted to a given 
™ sampling phase. The complete channel model is the_complete set of phase-specific 
(probabilistic) channel models for all L samplers or sampling phases. 

Each histogram value Is actually a frequency f or counter value c for the occurrence 
of an event. An event Is defined by a channel state and one ore more quantized 
associated amplitudes or sampling values within a period of time. Frequencies and 
25 counter values may be assumed to be proportional to estimates for transition 
probabilities by the weak law of large numbers. As a consequence the branch 
metrics may be a logarithm In obtained from tne measurejrfrequencies-orcounter 
values by equation (1): 

BM(bTr)-^ln(c(b7r))— " : : < 1 ) 

30 Probabilities are normalized to one i.e. the sum of the probabilities of all possible 

_^ events is one. The co unter values could also be normalized to one b y dividing each 

~~ counter value by the sum of all counter values. However, tnis operation is not 
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necessary, since it decreases ail branch metrics by the same amount. For findir 
the most likely path only the differences between the branch metrics influence tt 
result For the same reason, the base of the logarithm (log, In, Id) and the different 
between frequencies and counter values being the accumulation period 

5 insignificant. There is a one-to-one relation between events and frequencies ar 
counter values. It is assumed that there is also a one-to-one relation betwet 
counter values and branch metrics in the embodiments disclosed in the followin 
However, this is not necessarily the case in general. Statistical information lit 
counter values may be obtained for different purposes than the branch metr 

10 calculation. To this end a larger number of events and corresponding count 
values than the number, of branch metrics may be. obtained. Before taking 
logarithm, the values of counters belonging to a subset of counters may be adde 
There may be more than one subset of counters. . 

Due to the one-to-one relationship between counter values' or frequencies ar 
15 branch metrics, counter values, frequencies and branch metrics may be arranged 
a similar fashion and stored in similar data structures (of. Fig. 6 and 7). Moreovc 
due to the one-to-one relationship the branch metrics may be referred to as 
channel model. 

It must be made sure, that none of the frequencies or counter values of which tt 
*2olbg£Htnm according to equatfon (i) is taken' is equivalemTo"zerb7~Tfiis~mIy t 
performed by replacing 0s by low Values, by interpolation as explained in SauerOO ■ 
by fitting a model to the measured frequencies or counter values as shown in Fi 
12. 

A model distribution that is known to be appropriate for the channel in question (e. 

25 truncated Gaussian for noise limited links or truncated chi-square for optical 
amplified links) may be fitted to the measured histogram in step 82 after frequeni 
or counter values are measured in step 81. Then. the model distribution is evaluate 
in step 83 for the observed counter values or frequencies in order to obtain mod 
valuesr-T-he-usuaHog-likelihood metric is then determjnedHn-step^84-by-taking 

30 logarithm of each model vaiue. This has the advantage, that the model distributioi 
do not provide 0-probabiHties which causes difficulties when taking the logarithr 
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Then the process is repeated starting with the accumulation of counter or frequency 
values. 

It is possible to use only a subset of the detected symbols for channel estimation 
(sub-sampling) in order to trade-off complexity in particular high-frequency 
5 performance against acquisition and tracking speed. 

In the most specific embodiments, the branch metric for a transition is determined 
from L=2 samples tantamount to two-fold oversampling. Some of the embodiments 
can easily be generalized to L>2 samples. 

As described earlier, the simplest method is to treat the L samples per symbol as 
10 conditionally independent, when conditioned only on the same channel state. This 
leads to S N Q different events and corresponding counter or frequency values. The 
frequency values 63 may be arranged In table form as shown in Fig. 6. The 
frequencies for one channel state 62 corresponding to an amplitude histogram are 
arranged in one row, whereas the frequencies belonging to one sample value 61 are 
15 arranged in one column. The independence of the symbols leads to a sum of 
sample branch metrics BM(b,r) 64 where each metric for a given sample depends 
only on the channel state b (trellis transition) and on the sample value r. The sample 
branch metrics 64 may be arranged in a similar form as the corresponding 
- frequendes r -as.;smown = in=Sg~7. The overall branch metric. SM tot =is.calGulated=byr 
20 equation (2): 

BMtotfer! r,)= SBM(b,r,) < 2 > 

. 1=1 

In equation (2) and in the following time dependence on discrete time index i is 
suppressed where possible. However, in reality, the two samples associated with 
one symbol (bit) are correlafe^With-each-other^Moreovererg^noise-coloring-in-the- 
25 receiver and the fact that the real channel memory is actually larger than the model's 
channel memory (i.e. due to so called convolution noise) influence the correlation of 
samplesTUmi^tl^lSl=cauSed correlation between samples of^jacenrbitsTwhich 
is implicitly accounted for In the trellis diagram, this correlation is neglected in the 
simple realization above: by adding the metric values of the two samples, which 
30 co rr esponds to th e p Fe duct of their pr n h a hilities , the two-samples-are-treated as 
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stochastically Independent this simplification is sub-optimal because any existin, 
noise con-elation is not exploited. 

As shown in KavcicOO and Kavdc98. it is possible to take noise correlation ove 
several symbols into account 

5 ITT'.'" 8 ,raC,,0na " y ^ ° f * Ms inventon *° ** would be tc 

start w,m taking 0,e chelation be^een samples belonging to the J ^Mn 

account, which is expected to be even more signmcan, man the corretafion £2 
samples a, termer distance. „ is no, necessary to assume any speJTeg 
Gaussian, form of the noise process, as it is implicitly ecooumedT^e 
10 measured" probabilistic channet mode,: WKh. mese modlficafions. the add^ 
col ^ T!"^' W r 2 following a fire, sample r, is addifionally 
Z£T h d6Pendent ' n 1,19 «n 9 sample r„ In addition 

to ^ dependence on me channel state a and me va.ua of me sampfe r 2 Hseif The 

is rr rr t° 8Mw ,s eaiouw,ed as - * a ** « 

22Tb T ° °" W 8,3,6 " ^ 8,9 *"* ^«> te * °"« ■ — "d 
sample branch memo BM^.r.r,) depending in addition en'the second sampie r 2 : 

BM tol fe i r 1 ,r ? )=BM 1 (b,r 1 )+BM 2 (b,r 1 ,r 2 ) 

(3) 

torn, as shown ,n Flg. 8 . Reference numeral 65 refers to me f^sa^pielT^ 
^ «"*"* ~» numerals 68ahd 69, may be ar^ngeo 

,n a mree-d.mensiona, structure as shown in Fig. 9. This means mat second sample 
branch m e, c BM 2 for a specific firs, sample r, may be arranged in tabto form ££. 
1) omer tables 69 are necessary to form ,he completo mree^imensional sbuctore 
Reference, numeral 67 refers to me second sample r, In order to take ,he sampie 

as cordon ,n,o account i, is possible to -measure" for me second sample r 2 me 
ampktude dismbuflon conditioned on the channel state 62, me value of the second 
sample 67 and on me sample value of me first sample. This leads to a significantly 
.ncreased. DUm ^^ * 
of the second sample (Q amplitudes for me Are, sample times Q for «he~secon"d 7 

30 sampie as opposed to ,ust Q in ma simple scheme) and to a corresponding* longer 
accumulabon period for the sam* statistical significance. 
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In order to reduce complexity, events may be defined by a channel state b, the first 
sample value r1 and the second sample value r2 requiring Q 2 S N counters. Both, the 
counter values and the resulting branch metrics may be arranged in a three- 
dimensional structure as shown in Fig. 10. This means that sample branch metric 

5 BM for a specific first sample n may be arranged in table form 70. (Q-1) other tables 
71 are necessary to form the complete three-dimensional structure. The advantage 
of this procedure is that the overall branch metric can be immediately looked up 
without the need for an addition. Probably less important is that S N histograms 
equivalent to a table shown in Fig. 8 can be saved compared to the embodiment of 

10 Figs. 8 and 9. 

In order to further reduce the complexity of the approach illustrated in Fig. 8 and 9, 
the second sample value r 2 could be conditioned only on a more coarse-grained first 
sample value: Rather than distinguishing Q amplitude levels for the first sample, only 
Q* < Q amplitude levels could be distinguished for the first sample. The case Q'=2 
15 would be the minimum, corresponding to a "tentative hard decision" on the first 
sample. In this case the channel model size for the second sample is only doubled. 
This leads to 

BM to t(b,r 1 ,r2) = BM 1 (b,r 1 )+BM 2 (b,R(r 1 ),r 2 ) (4) 

where R is the additionah(coneeptual) quantizer that maps the Qpossible^amplitude=^ 
20 values into the Q'<Q possible coarse amplitude values. It may be implemented by 
simply taking Into account the most significant bit(s) of r 1; Also in this embodiment 
the second sample branch metric BM 2 for a specific first sample n may be arranged 
in table form 72. (Q*-1) other tables 73 are necessary to form a complete three- 
dimensional structure as shown in Fig. 11. 

_ ?R -nhv/ir>iifii yi _fiirther_ correlatio n schemes are conceivable, as e.g. one talking 
correlation between samples of adjacent symbols into account (discrete time index 
necessary for clarity). 

BM tot (b.ri[ilr 2 l^ <S 
In equation (5) discrete time index i was added for clarity. 
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Rather than using conventional branch metrics, which constitute logarithms ol 
transition probabilities, branch metrics could be used, which are proportional to the 
transition probabilities. In the latter case branch. metrics must be multiplied in order 
to find the most likely path. As a generic term for mathematical operations like 
5 adding and multiplying "combining" is used. 

in another embodiment a fractionally MLSD detects a fractional symbol for each 
sample provided by the ADC 13 which : performs L-fold oversampling. In this 
embodiment each channel state is defined by a sequence of h precursor fractional 
symbols, a current fractional symbol having m value- r/,, and j postcursor fractional 

10 symbols. In this embodiment the MLSD generates, detected data u„ at a frequency 
L times higher than the symbol frequency: Under ideal circumstances all over- 
sampled detected fractional data ii„, within, one- symbol period and consequently 
having the same i should be equivalent no matter which value I has When 
calculating the branch metrics from measured frequencies or cp'unter values a model 

15 may be used which takes into account that all fractional symbols belonging to one 
symbol should have the same value, « n order to enforce that all fractional symbols 
belonging to one symbol are identical, the conditional probabilities for all transitions 
between a first fractional symbol and a second fractional symbol belonging to the 
same symbol may be set to 0. if the first, and the second fractional symbols are 
. 20. different apd set to 1 otherwise. . V 

In another embodiment iritra-symbol transitions between different fractional symbols 
may be allowed. In this embodiment the MLSD 17 may be considered to provide soft 
decision results I. e. identical fractional symbols for more reliable symbols and 
differing fractional symbols for less reliable symbols. Actually a soft metric may be 
25 defined by the number of fractional symbols having a value of 1 and belonging to a 
current symbol divided by the oversampling factor L. The final decision about a 
symbol is up to the FEC decoder, which may reverse symbol decisions in any 
embodiment in order to reduce the BER from typically 1Cr< after the MLSD to below 
10' required for data" transmission. * 

30 The estimated "channel model" consists of a finite set of (S N -Q) branch metrics BM 
which my be arranged in table form as shown in Fig. 2b. There is one branch metric 
provided for every-channelTstate-and for every quantized data value rrat the-carrent- 
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symbol. The branch metrics may be stored in a two-dimensional array and 
addressed by two indices one ranging from 0 to S N -1 and designating the channel 
states, the other ranging from 0 to Q-1 and designating the quantized value of the 
current symbol. In another embodiment the branch metrics may be arranged in an 

5 (N+1)-dimensional array. The frequencies, counter values or branch metrics may be 
arranged in memory in a similar form as the branch metrics i. e. in a 2-dimensional 
or (l+1)-dimensional array. More specifically the frequencies 63, the sample branch 
metrics 64 and 66 may be stored in a two-dimensional array, whereas sample 
branch metrics 68, 69, 70, 71 and 72, 73 may be stored in a three-dimensional array 

10 for two-fold oversampling. 

In another embodiment all data structures may be stored in one-dimensional arrays. 
The index of the array element storing frequencies, counter values or branch metrics 
is obtained by concatenating the channel state and the sample vahlie(s). 

In yet a further embodiment, each symbol defining a channel, state may be used as 
15 an index in one dimension. The arrays are (N+1) dimensional or (N+L) dimensional. 

It is noted that from the frequencies or counter values which may be arranged in one 
or more data structures illustrated by Figs. 8 and 9, Fig. 10 or Fig. 11 the counter 
values of counters 14 to 21 of EP03004079.4 can be calculated by summing the 

.;. : frequencies f or counter- values .over-allrGhannel states b for each digitizedr valued 

20 of first sample (Fig. 8 and Fig. 10) and by summing the counter values over all 
channel states b and first fractional samples ry. for each digitized value r* u of second 
sample(Figs. 9, 10 and 11). More specifically, in the case of Fig. 8 and 9 the 
following equations (6) and (7) may be used to calculate the counter values count u1 
of counters 14 to 17 of EP03004079.4 and county of counters 18 to 21 of 
25 EP03004079.4: 

county -£fife»i) 

b 

county =XS f 2fe , V2)" 

PJ b 

In the case of Fig. 10 the following equations may be used: 



(6) 

(7)— 
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<»^*a -ZZ*&ri.*)' 

fj b w 



In the case of Fig. 1 1 the following equation may be used to calculate 



count 2ir2 : 



count 2.^ - Z ZfcfeRta).*) Mm 

R(pj) b ( 10 ) 

5 From the latter calculated values a population difference parameter may be 
calculated in compliance with EP03004079.4 for controlling SPA circuit 15 in order 
to optimize the sampling phase. 

In a similar fashion, by summing over all channel states and sampling phases if 
applicable for each digitized value r„ counter values count, for counters 51 to 54 of 
10 subsets S q of EP03009564.0 may be obtained from which a population difference 
parameter as described in EP03009564.0 may be calculated. In accordance with 
the disclosure of EP03009564.0 this population difference parameter may be 
m,n.mized that the receiver control node20 sets the AGC/VGA 12 to an appropriate 
..e. optimized amplification. In the case of the embodiment of Figs. 8 and 9 equation 
= i5.(1.1J>maybeused.. 

™*q- Z Zfi<br,)+ Z ZZf2<b,ri.r 2 ) 

In the case of Fig. 10 the following equation may be used: 

°° unt q - Z ZZ%r 1f r 2 ) + £ ZZ*'i.*) (12) 
In the case of Fig. 1 1 the following equation may be used: 

Assuming that the channel statistics is a correct model of the actual channel the 
branch metrics derived from-the-channel-model are used to detect the bit sequence:- 
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In order to track the channel, the sample values and the detected bit sequence are 
used to measure the channel state conditioned amplitude statistics, i.e. a new 
channel model. In order not to overload the control node 20 and at the same time to 
optimize tracking capability, several model-updating strategies may be used. In the 
5 simplest case the current channel model is used to detect the received bits for a 
period of time, called accumulation period 171 (see Fig. 13). During this 
accumulation period 171, new channel observations are made. After the observation 
period 171, the measured amplitude histograms are used to compute new branch 
metrics during a computation period 183. Finally, the new branch metrics are loaded 
10 into the MLSD and the cycle restarts with accumulation period 191. Between 
accumulation periods 171, 191 and computation period 183 transfer delays 182 may 
occur. The period during which no acquisition takes place may be designated idle 
period which comprises the transfer delays 182 and the computation period 183. 
This cycle is called update cycle 170 (iteration, period, or interval). 

15 To speed up the update of the branch metrics and to shorten the idle periods, the- 
calculation of the branch metrics may be performed in an interlaced manner as shown 
in figure 13. 

While the accumulation of count c k (b,ri,r 2 ) are accumulated in a accumulation period 
171 in period k the branch metrics BM k .i are calculated during calculation period 173 
WWsed ~6h counter values accUniOiale^ period k-1. In the 

following period k+1 the branch metrics BM k .i are used to detect the symbols and 
consequently to accumulate counter values Ck + i(b,ri,r 2 ) during accumulation period 
181. Simultaneously during computation period 183 the branch metrics BM k are 
obtained based on counter values Ck(b,ri,r 2 ). These branch metrics will be used for 
25 symbol detection during accumulation period 191. In this embodiment the idle 
periods 174 and 184 during which no accumulation is performed are significantly 
smaller than in an embodiment in which accumulation and metric compQtation _ are — 
performed consecutively. An update cycle 170 comprises two periods e.g. periods k 
and k+1 .In another embodiment old and new frequencies or counter values may be 
30~combined using a forgetting factor. That means that the old data are weighted by the 
forgetting factor and the new data are weighted by (1 -forgetting factor) before the 
weighted data are added to form the hew data. The same procedure may be applied 
to the branch metrics ratiie i than th e-freqtte ncioo or count e r value s the br a nch" 
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metncs are ca,cu,ated from. This saves resources since it is not necessary to sav, 
the old frequencies or counter values whereas the old branch metdcs have to b, 

ZTT'"* 8 ° Pera "° n ° ,,he MLS ° ^ "» l0 ^«"" >* a non-Hnea 
operation. However, only smell changes of the branch metrics are expected fron 

6 ,0 , UPdate ' ™ s *— - -PI*-" of a testing factor diracUy on ft. 

branch metrics. 

h .contrast to the embodiments ot this invention, known FS MLSE naoeivers based 
on filter models of channels (e.g. us 5.313.495 and US 5,263.053) update me 
channel parameters in symbol time which requires more circuit resources On the 
to after hand Known FS MLSOs are empioyed In celiuiar telephone syslemTwh^ dl 
not operate at high transmission rates. 

For channels impaired mainly by GVD and or PMD, channel acquisition ,s started 
from a starting channel model as shown In Fig. 15. These antfthe derived branch 
memos ana sufficient to acquire the conect channel model in a few update iterations 

d,spera,o„ patterns of isolated zeroes and isolated ones show a -ftreshold 
crossing- behavtor as shown in Fig. 14: e.g. tor tow dispersion, me maximum TZ 
response 161 to an isolated one is wen above a threshold of 0.5. wheroas for higher 
*^"°^ nd " P"'*° broadening the maximum of response 182 remains 

« below the threshold. Conseque^yWSarting His^h W * detected sequence' 
of 010 ,s chosen identical to a detected sequence of 101 as shown in Fig 15 

TZLn 889 T m hfe, ° 9ramS 38 alm °* ^mmewca they 

wiii ften conveme m the correct directton. The differam storting histograms for ea* 

channel state are shown for each in Figure 15, whera the arrows roughly indicate 
25 the mean value of each histogram type. . 

For more general channels, a set of channel models may be requlroa in order to 
ensure convergence of the acqulsiflon procedura. Such an acquisition procedure is 
rtlustrated ,n F,g. 16. A suitable set of channel models is provided in step 202 it can 
be-used e.g. in a try-and-error feshion-asHllustrated-bysteps 204 to 209 or based on 
so some auxiliary channel measurements e. g. based on next neighbor autocorroiation. 

The starting channel model can be identical for the L=2 sa mplino phases. This does 

not on v annlv tn «h Q ^^k^: * 17^1 Z - 



. . , urz wq iiipmiy tjiiases. misaoes 

not only apply to me embodiment of Figa. 6 and 7, acoonling to which identical 
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branch metrics are used for the first and second sampling phase, but also to the 
other embodiments of Figs. 8 to 11. If specific, non-symmetrical, starting channel 
models are used, it may be necessary to perform a try-and-error procedure for the 
L=2 different settings of the bit rearranging circuit, or to ensure a minimum (quasi-) 
5 continuous phase adjustment setting at the begin of channel tracking. 

Channel monitoring as illustrated by steps 204 to 209 may be performed as a part of 
the acquisition procedure in order to select an appropriate starting channel model. 
On the other hand channel monitoring can be an ongoing process during channel 
tracking in order to detect the need for a channel re-acquisition procedure. It is | 
10 based on several observables: 

• LOS: When the PI signals LOS, channel is considered lost. A re-acquisition 
procedure is started once LOS clears in step 206. 

• BER estimation: When the estimated BER is above a given threshold in step 
207, a channel re-acquisition is started. A new channel ;re-acquisition may be 

15 prevented, if a period of time t B ER since the previous reacquisition did not yet 

elapse. Before initiating the re-acquisition in step 204 a new starting channel 
model is selected in step 209. 

• Channel Model Verification: The histogram s of the channel model are 
monitored for pathological amplitude statistics in step 208. Before initiating 

20 the re-acquisition in step 204 a new starting channel model is selected in step 

209. Some examples of model insanity indicators are: 

■ Correlation between channel state 1 1 1 and 000 above a given threshold 

■ Mode of 1 1 1 histogram below given threshold 

■ Mode of 000 histogram above given threshold _— 

25 ■ Correlation of histograms with a uniform histogram above a given 
threshold " 

The (statistical) mode is the value where the probability distribution (histogram) has 
a maximum. The maximum of the 11 1 histogram is expecte d at medium to high 
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quantized amplitude level and the maximum of the 000 histogram at low amplitud, 
levels. A histogram is uniform, if all bins have value 1/Q. 

Optionally, signal statistics of the L samples per symbol are measured These car 
be used for channel monitoring or to estimate channel conditions e.g. to distinguish 
5 a high dispersion from a low dispersion case. In particular these are 

• Measured values of the sample autocorrelation function R(0), R(t/2) R(T), 
R(3T/2), relative to timing phase of first sample. E.g. for the case L==2 we have: 

w-Xf* (14) 

■V'^-X&fcB (15) 
1 ° RCT)-X&lhl+i] . ' (16) 

R(3T/2)»X,&[i>2D+l] 

k v 1 



• Population difference parameter as per E~P0§d64079.?~: 

• Uniformity parameter as per EP03009564.0 



15 In another embodiment, acquisition could further be achieved by a suitable explicit 
training sequence, especially employing critical or characteristic patterns such as 
.solated one (...00100...) or isolated zero (...11011...) I. e. 010 and 101 in the case 
of N=3. This could either substitute or aid the acquisition using predetermined 
starting, histog rams. Selection of starting histograms could be based on measured 

20 estimates of the sample autocorrelation function values R(T/2), R(T),~R(3T/2). 

Further modifications and variations of the present invention will be apparent to 
-toose-skilledJn the art in view of this description _Acconding..y,.this. description is to 
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be construed as illustrative only and is for the purpose of teaching those skilled in 
the art the general manner of carrying out the present invention. It is to be 
understood that the forms of the invention shown and described herein are to be 
taken as the presently preferred embodiments. 
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Claims 02 Jull 

1 . A method for channel estimation for an optical receiver comprising: 

digitizing(13) an analog signal (r (t)) representing a sequence of symbols ( di 
thereby associating one digital word <r f>1 , r„ 2 ) out of a plurality of possible digita 
words to the level of said analog signal at each sampling time (53 54)- a 
symbol period (51, 52) comprising at least two sampling times; each digita 
word corresponding to one out of a plurality of quantization levels (61 , 65, 67); 

determining (17) the most likely sequence (u.) of said symbol's^,); 

providing a branch metrics (64, 66, 68, 69, 70,71, 72^ 73); 

characterized by 

obtaining to said branch metrics (64, 66, 68, 69, 70, 71 72 73) from 
frequencies (63) of digita. words (r M , r„ 2 ) resulting from said digitizing (13) and 
the symbols of said most likely sequence (u,): 

2. A method for channel estimation comprising: 

digitizing(13)_an anaiog signal (F(t)) comprising a sequence of symbols (d.) 
thereby as^eialih^ brie digital word (r,,,, r„ 2 ) mof^^^^gm^o^s 
to the level of said analog signal at each sampling time (53. 54); a symbol 
penod (51. 52) comprising at least one sampling time; each digital word 
corresponding to one out of plurality of quantization levels (61 , 65. 67); 

determining (17) the most likely sequence (u,) said symbols (d,); 

counting (81) events; thereby obtaining a counter value for each event- each 
event being defined by a channel state (62) and a current digital word,' each 
channel state being defined by a pattern of symbols relative to a current 
symbol determined at the time of said current di gital word; 

characterized by 

fitting (82) a modeldistribution to said counter values; and 
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obtaining a branch metric (83, 84) on the basis of said fitted model distribution. 

The method of claim 2, characterized in that a symbol period (51, 52) 
comprises at least two sampling times (53, 54); 

The method of one of the claims above characterized by: 

counting each kind of events during said digitizing, each event being defined 
by a channel state (62) and a digital word out of said plurality of digital words; 
each channel state (62) being defined by a sequence of symbols; 

calculating a sample branch metric (64) for each kind of events; and 

calculating a branch metric by combining the sample branch metrics (64) for 
each digital word obtained at a sampling time (53, 54) during a symbol period. 

The method of one of the claims 1 to 3 characterized by:, ' 

« 

/ 

each symbol period comprising a first (53) and a second sampling time (54); 

associating a first digital word (r M , 65) at the first sampling time (53) and a 
second digital word (r il2 , 67) at the second sampling time (54) to said analog 
signal; 

countinga ^firstT^indsofeyents, each first kind of events being defined by a first 
channel state (62) and a first digital word; each first channel state (62) being 
defined by a sequence of symbols comprising the symbol to. which said first 
digital word is associated; 

counting second kinds of events, each second kind of events being defined by 
said first channel state (62), said first digital word and second digital word 
following said'fifsfcligitisrword; ; 

calculating a first sample branch metric (66) for each first kind of event; 

' calculating a second sample branch metric (68, 69) for each second kind ol 
event; 
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calculating a branch metric for a second channel state (62) and a third and 
fourth digital word by combining the first sample branch metric (66) for said 
second channel state and third digital word and a second sample branch 
metric (68, 69) for said second channel state, said third digital word and said 
fourth digital word; said second channel state (62) being defined by a 
sequence of symbols comprising a symbol to which said and third and fourth 
digital words are associated. 

6. The method of one of the claims 1 to 3, characterized by: 

each symbol period comprising a first (53) and a second sampling time (54), 

associating a fist digital word ( n , 1f 65) at the first sampling time (53) and a 
second digital (r,, 2 , 67) word at the second sampling time (54) to said analog 
signal; , 

counting first kinds of events, each first kind of events being defined by a first 
channel state (62) and a first digital word; each first channel state (62) being 
defined by a sequence of symbols comprising the symbol to which said first 
digital word is associated; 

grouping said possible digital words into groups of digital' words; associating a 
• coarse digifclwo^ / • 
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counting second kinds of events, each second kind of events being defined by 
said first channel state (62), the coarse digital word associated to said first 
digital word and the second digital word following said first digital word; 

calculating a first sample branch metric (66) for each first kind of events; 

calculating a second sample branch metric (72, 73) for each second kind of 
events; 

calculating a branch metric for a second channel state (62 ) and a third and 
fourth digital word, by combining the first sample branch metric (66) for said 
second channel state and said third digital word and a second sample branch 
metric (72, 73) for said second channel state (62), a coarse digital w ord 
associated with said third digital word and said fourth digital word; said second" 
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channel state (62) being defined by a sequence of symbols comprising the 
symbol to which said third and fourth digital words are associated. 

7. The method of claims 1.to 3, characterized by: 

each symbol period comprising a plurality of sampling times (53, 54); 

counting kinds of events, each kind of event being defined by a first channel 
state (62), and a digital word (n,i, r i(2 ) for each sampling time during a symbol 
period of a first symbol; 

each channel state (62) being defined by a sequence of symbols comprising 
the first symbol; 

. calculating a sample branch metric (70, 71) for each kind of events; and 

calculating a branch metric for a second channel state (62) and a second 
symbol by combining the sample branch metrics (7J0, 71) for said second 
channel state (62) and each digital word associated to said second symbol; 

said second channel state (62) being defined by a sequence of symbols 

* 

5 comprising the second symbol. 

. • f * ■ .• 

i 

8. The method of one of claims 4 to 7, characterized in that said sample branch 
metric Ts a~* logarithm' of "the "cotfrt of' the "respective T kind of events" and 

combining of sample branch metrics is performed by adding same; , 

9. The method of one of claims 4 to 9 referring to claim 1, further characterized 

»o by: 

fitting (82) a model distribution to the counts of each kind of events ; 

e^l0atmg"(83)-theTnodel distribution-for-each-kind of-events-in-order_to-obtain_ 

one model value for each kind of events; 

calculating-(84.)-said-sample-branch metric for each kind of-events-using-the— 
25 respective model value. 

10, The method of one of the claims above, further comprising: 
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recovering (14) the clock of the symbols; and 

delaying (15) said clock In order to minimize the bit error rate of said digitize, 
by optimizing the sampling times during said digitizing (13). 

The method as claimed in claim 10, wherein the delay of said clock is adjusted 
based on bit error rate estimates obtained from said most likely sequence (u.) 
of symbols (d,). 1 u 

The method as claimed in claim 11, wherein the delay of said clock is adjusted 
in order to maximize a population difference parameter. 

The method of one of the claims above, further comprising: 

adjusting the sampling times by delaying (15) said clock (33) by a delay being 
quasi^ontinuously adjustable within a range of half of a symbol period and 

performing discrete sampling time (53, 54) adjustment by rearranging (16) the 
sequence of digital words. 

The method of one of claims 2 to 1 3, characterized by: 

counting (181) each kind of events during a current accumulation period (k-1) 
said channel -state^62),being,determined on branch metrics being, calculated- 
(1 73) during a previous accumulation period (k); and 

calculating (183) branch metrics for determining the channel states during a 
following accumulation period (k + 2) while each kind of events is counted (181) 
during the current accumulation period (k+1 ). 

The method of one of claims 1 to 13, characterized by: 

calculating (183) the branch metrics for determining the channel states during 
a following accumulation period (k + 2) as the sum of the branch metrics for 
determ,ning.-the-channel_state-during the current accumulation _period_(k + 1)_ 
we.ghted by a forgetting factor plus the logarithm of the respective count of 
events obtained (171) during the previous accumulation period (k) weighted by 
one minus theforgettjng factor. 
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1 6. The method of one of claims 1 to 1 3, characterized by: 

calculating (183) the branch metrics for determining the channel states during 
a following accumulation period (k+2) as the logarithm of the sum of the count 
of events obtained during accumulation periods before the previous 
s accumulation period (k) weighted by a forgetting factor plus the count of 
events obtained (171) during the previous accumulation period (k) weighted by 
one minus the forgetting factor. 

17. The method of one of the claims above, characterized by: 

defining a channel state (62) as a sequence of binary symbols which may be 
10 represented by "0"or"1"; and 

setting the branch metrics for the channel states /dr isolated Os and 1s 
i.e.".. .11011..." and "...00100...", respectively, /to identical/ values when 
initializing the branch metrics. , 

18. The method of claim* 17, characterized by: 

15 defining a channel state (62) as a sequence of binary symbols which may be 
represented by "0" or "1"; and 

" setting the branch "mWicTwl^nneT^sMes symmetrical to a considered!" 
symbol b 3 i.e. bib 2 b3b 4 b5 and b 5 b 4 b3b2bi to identical values when initializing the 
branch metrics. 

20 19. The method of claim 18 characterized by: 

monitoring at least one of the following conditions: 

a loss ofsignal output"bY'men?hysical interface-(11)-is-being-cleared-(206)- 

a bit error rate in said most likely sequence (uO of symbols is still above a 
predetermined -threshold_(.20Z)-after_a-predetermined period of time after_the— 
25 initialization of said branch metrics; 

pathological amplitude statistics are determined (208) comprising one of the j 
following cases: j 
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correlation between a only-1 channel state (62) and an only-0 channel 
state (62) being above a predetermined threshold; 

mode of only-1 channel state (62) below predetermined threshold; 

mode of only-0 channel state (62) above predetermined threshold; and 

correlation of histograms with uniform distribution above a given 
threshold; 

initializing (204) said branch metrics if one of the conditions above occurs. 
The method of claim 19, characterized in that: 

the branch metrics are reinitialized (209, 204) with different values than the 
values used at the previous initialization. 

A symbol detector for an optical receiver comprising: 

an analog-to.digital converter (13) for digitizing* 13) an analog signal (r(t)) 
comprising a sequence of symbols (d,) thereby associating one digital word 
(r.,i. r,, 2 ) out of a plurality of digital words to the level of said analog signal at 
each sampling time (53, 54); each symbol period (51, 52) comprising at least 
two sampling times; sach^^ 
quantization levels (61 , 65, 67); 

a fractionally spaced maximum-likelihood sequence detector (17) connected to 
said analog-to-digital converter (13) for determining the most likely sequence 
(u,) of said symbols (d,); 

a channel model unit (19) connected to said maximum-likelihood sequence 
detector (17) in order to provide branch metrics (64, 66, 68, 69, 70, 71, 72, 73) 
to said, maximum-likelihood sequence detector (17); 

characterized in that — 



said branch metrics are obtained from frequencies (63) of digital words (r,, lt r if2 ) 
output by said analog- to-digital converter (1 3) and the symbols (u.) determined 
by said fractionally spaced maximum-likelihood sequence detector (17). 
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22. A symbol detector for an optical receiver comprising: 

an analog-to-digital converter (13) for digitizing(13) an analog signal (r (t)) 
comprising a sequence of symbols (d t ) thereby associating one digital word 
(r ( .i.ri,2) out of a plurality of digital words to the level of said analog signal at 
5 each sampling time (53, 54); each symbol period (51 , 52) comprising at least 
one sampling time (53, 54); each digital word corresponding to. one out of a 
plurality of quantization levels (61 , 65, 67); 

a maximum-likelihood sequence detector (17) connected to said analog-to- 
digital converter (13) for determining the most likely sequence (Uj)'of said 
10 symbols (di); 

• ./' 

* 

a channel model unit (19) connected to said maximum-likelihood sequence 
detector (17) for providing branch metrics (64, 66, 68, 69, 70^71, 72, 73) to 
said maximum-likelihood sequence detector (17) and for counting events; each 
event being defined by a channel state (62) and a current digital word; each 
15 channel state (62) being defined by a pattern of symbqls relative to a current 
symbol determined at the time of said current digital word; a counter value (63) 
being associated to each event; , ' 

characterized by in that . , 

said channel model unit (19) fits (82) a model distribution to said counter 
20 vaiues (63) and obtains (83, 84) a branch metric (64, 66, 68, 69, 70, 71, 72, 
73) based on said fitted model distribution. 

23. The symbol detector of claims 21 or 22, characterized in that said analog-to- 
digital converter (13) performs two-fold over-sampling. 

24. The symbol detector of one of claims .21 to 23, further comprising a clock 
25 recovery subsystem (14) connected to the input of said analog-to-digital 

converter (13) for receiving said analog signal (r (t)); said clock recovery 
subsystem (14) for recovering the clock of the symbols; 

a sampling phase adjustment circuit (15) connected to said clock recovery 
. subsystem (14) for delaying-said-GlQGk^in-order-to minimize the bit error rate of — 
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said symbol detector by optimizing the sampling times of said analog-to-digital 
converter (13); said clock recovery subsystem (14) being connected to said 
analog-to-digital converter (13) for providing said delayed clock to said analog- 
to-digital converter (13). 
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EPO- Munich 

Abstract . U Mi 2003 

The. invention relates to a method for channel estimation. The method comprises 
digitizing an analog signal representing a sequence of symbols thereby associating 
one digital word out to the level of said analog signal at each sampling time. The 
5 most likely sequence of said , symbols is detected. To this end a branch metrics is 
provided. According to one embodiment a symbol period comprises at least two 
sampling times. Moreover the branch metrics are obtained from frequencies of 
digital words resulting from a digitizing and the symbols of the most likely sequence. 
According to another embodiment a symbol period comprises at least one sampling 
10 time. Events are counted wherein each event is defined by a channel state and a 
current digital word., Each channel state is defined by a pattern of symbols relative 
to a current symbol determined at the time of a current digital word. A model 
distribution is fitted to event counts and a branch metritis is obtained from the fitted 
model distribution. Moreover the invention relates to corresponding symbol 
15 detectors for optical receivers. 
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BM^ (b^p^) = BM(b,r 1 )+BM(b ,r 2 ) 
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BMtot^r^r^) = BMCkr^) 
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